Some Performance Issues Associated with CEWESMSRC Distributed - Memory ArchitecturesS
نویسنده
چکیده
Views, opinions, and/or ndings contained in this report are those of the au-thor(s) and should not be construed as an oocial Department of the Defense position, policy, or decision unless so designated by other oocial documentation. Abstract Some typical performance issues associated with sparse matrix codes which use the Message Passing Interface and distributed memory machines will be discussed. In particular, average, sustained, oating point performance will be examined as a function of problem size and number of processors for an unstructured uid mechanics solver. These issues will be examined within the context of the iterative solution of nonsymmetric, linear systems of equations, such as those that arise from nite volume approximations to convection-diiusion problems. A brief description of the parallel CGSTAB (stabilized bi-conjugate gradient) iterative method will be followed by comparisons of its performance on the IBM SP, SGI Origin 2000, and Cray T3E at the Corps of Engineers Waterways Experiment Station Major Shared Resource Center (CEWES MSRC). In general, these machines have chips that are rated at several hundred MMops, but actual performance obtained in practice is much less. Sustained performance is a strong function of cache and communication performance. The aim of this report is not to determine which machine is \better" for certain classes of problems, but rather to illustrate performance trade-oos that users can expect to address in most problems run on these machines.
منابع مشابه
CEWES MSRC/PET TR/97-02 Some Performance Issues Associated with CEWES MSRC Scalable Architectures
Some typical performance issues associated with sparse matrix codes which use the Message Passing Interface and distributed memory machines will be discussed. In particular, average, sustained, oating point performance will be examined as a function of problem size and number of processors for an unstructured uid mechanics solver. These issues will be examined within the context of the iterativ...
متن کاملA High Performance Parallel IP Lookup Technique Using Distributed Memory Organization and ISCB-Tree Data Structure
The IP Lookup Process is a key bottleneck in routing due to the increase in routing table size, increasing traıc and migration to IPv6 addresses. The IP address lookup involves computation of the Longest Prefix Matching (LPM), which existing solutions such as BSD Radix Tries, scale poorly when traıc in the router increases or when employed for IPv6 address lookups. In this paper, we describe a ...
متن کاملA High Performance Parallel IP Lookup Technique Using Distributed Memory Organization and ISCB-Tree Data Structure
The IP Lookup Process is a key bottleneck in routing due to the increase in routing table size, increasing traıc and migration to IPv6 addresses. The IP address lookup involves computation of the Longest Prefix Matching (LPM), which existing solutions such as BSD Radix Tries, scale poorly when traıc in the router increases or when employed for IPv6 address lookups. In this paper, we describe a ...
متن کاملL2 Learners’ Strategy Preference in Metaphorical Test Performance: Effects of Working Memory and Cognitive Style
Although investigating the factors that influence test scores is important, a majority of stakeholders show a paucity of attention towards individual learner differences due to having large classes of L2 learners. This study sought to explore the possible effect of working memory and cognitive style on L2 learners’ metaphorical test performance. The study was conducted in 2 phases. The first ph...
متن کاملA Comparative Study of Distributed Shared Memory
In this research the various issues that arise in the design and implementation of distributed shared memory (DSM) systems are examined. This work has been motivated by two observations: distributed systems will continue to become popular, and will be increasingly used for solving large computational problems; and shared memory paradigm is attractive for programming large distributed systems be...
متن کامل